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Abstract 

The Hausdorff fractal dimension has been a fast-to-calculate method to es¬ 
timate complexity of fractal shapes. In this work, a modified version of 
this fractal dimension is presented in order to make it more robust when 
applied in estimating complexity of non-fractal images. The modified Haus¬ 
dorff fractal dimension stands on two features that weaken the requirement 
of presence of a shape and also reduce the impact of the noise possibly pre¬ 
sented in the input image. The new algorithm has been evaluated on a set 
of images of different character with promising performance. 

Keywords: Complexity, Fractal Dimension, Hausdorff Fractal Dimension, 
Illustrated Manuscripts 


1. Introduction 

The scale parameter plays an important role in image processing and 
understanding [5, 6]. However, it seems that the complexity parameter 
could be leveraged in order to improve the image processing methods. In 
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many other fields, fractals have been extensively used to represent and also 
to compress patterns [8]. To differentiate between fractals based on their 
degree of complexity without requiring a full model, a parameter called 
the Fractal Dimension (FD) has been commonly used [10-12, 14, 16, 17]. 
In particular, the Hausdorff Fractal Dimension (HFD)^ [7] has been used 
in many applications. A simpler form of the Hausdorff fractal dimension is 
the Minkowski-Bouligand dimension or the Box-Counting Dimension (BCD) 
[3, 9]. In this work, from here on, we use the BCD’s definition as that of the 
HFD. 

Although the HFD measure is practical and fast to calculate, we ar¬ 
gue that it could be improved by considering two features. Although these 
features will be discussed in greater details in Section 4, they are briefly 
mentioned here. The first feature promotes incorporating ‘non’-object data 
in the calculations. It seems that this has been implicitly considered in the 
definition of the HFD for fractals^ this could be easily violated when apply¬ 
ing the HFD to estimate the complexity of images that are not by nature 
fractals. The second proposed feature is a scholastic process to rule out as 
much as possible of those boxes that are noise-related. Again although this 
feature may be of no-impact in the case of true fractals, it could improve 
the estimations for the degraded, noisy actual images. 

The paper is organized as follows. In Section 2, some of basic notations 
are defined. Then, the baseline definition of the HFD is presented in Section 
3. This is followed by the proposed definition of a Modified HFD (MHFD) 
in Section 4. The illustrative examples are provided in Section 5. Finally, 


^The HFD is a divider approach to estimating a fractal dimension [2, 7]. It has been 
also referred to as the Hausdorff-Besicovitch dimension or the Richardson dimension. 
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the conclusions are presented in Section 6. 


2. General Notation 

In this section, the notation used in the following sections is presented. 

• /: The observed image (or patch): 

T _ { T \n,m _ j 

where m and n are the sizes of the image, and lij is an image pixel 
value at pixel position (i, j). In this work, it is assumed that the image 
pixel values are binary: lij G {O, l}, where 1 is the value of an ‘object’ 
pixel. For the purpose of reducing the ink use in print, the images may 
be shown in either BWOl or BWIO protocols^ [4] depending on the 
ratio of the object to non-object pixels. 

• Dj: The fractal dimension of an image /. 

• B (i, j, vo ): A square ‘box’ or patch at the pixel position (i, j) and with 
the patch width w. A particialut pixel (A:,/) in S(i,j,re) is dented 

In terms of the notations in [6], a box B(i^j^w) could be 
approximated to a patch P(^iJ^^w/2]j-\-[w/2]),[w/2],oo' 


3. The HausdorfF Fractal Dimension (HFD) 

Let’s consider a binary image I. Then, the S-tuple of possible box 
sizes, 5/, is defined as the integer interval [0,5], where 2^ > max(m,n): 

^The BWOl protocol means that the 0 pixels are shown in black while the 1 pixels are 
shown in white [4]. 
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Algorithm 1 Calculate the HFD using the discrete ^-tuples 5/ and A//. 

1: procedure HFD/ = HFDCalculate(/n^^) 

2: > First, calculate Sj and Mi 

3: S ^ max (log 2 m, log 2 n) 

4: parfor 5 = 0, • • • , S do 

5: Bs ^ |^(’5 *5 2^) B are not overlapping, 1) G B s.t. B^^i = 

6: Ns ^ the cardinal number of Bs , i.e., card(Ss) 

7: Populate Sj using the s value 

8: Populate A// using the Ns value 

9: t> Calculate HFDj using the least square regression 

10: Choose a linear regression model: Mji ^ D* • 

11: Choose a least square regression operator: i^LS 

12: Apply the i^LS operator to (^Mr, log (A//), log (l/2‘^^) ^ to get D 

13: HFD/ ^ D 


Si = For every box size 2^ with an s value from 5/, the number of 

‘non-overlapping’ boxes with at least one object-pixel is denoted Ng. The 
S-tuple of all Ns is denoted as A// = (A^5)f^o* Starting with the definition 
of the Haussdorf Fractal Dimension (HFD): 


F)/ = HFD/ = 


log (1/2®) 


( 1 ) 


and because of the discrete and finite nature of the 5/ and A//, an extrapo¬ 
lation could be instead used: 


Di = RFDi = argmini?Ls(log(7V.) = i?*log(l/2^) + /i*j, (2) 

s.t. Ns G A//, 5 = 0, • • • ,5, 


^The lower limit of s = 0 could be pushed down toward lower values using super 
resolution enhancement of the image /. 
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where i?LS stands for a Least Square regression operator. The complete 
algorithm to calculate the HFD is provide in Algorithm 1.^ 


Algorithm 2 Calculate the proposed MHFD. 
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procedure MHFD/ = MHFDCalculate(/n^^) 

> Optional preprocessing 
(Optional) / ^ Denoised I 
(Optional) I ^ Edges of / 

(Optional) / ^ Skeleton of / 

> Then, calculate the new Sj and A/} 

S ^ max (log 2 m, log 2 n) + 1 
parfor 5 = 0, • • • , S' do 


Bg ^ 
Bs <— 


B, ^ 


|h(', •, 2 ^) ^Bs are not overlapping, 3{k, 1) e B s.t. B^^i = 1^ 
B G Bs, 3(/c, 1) G B s.t. Bk^i = o| 

B e B„U{0,caxd{B = 1) + l) - X 9 a;B > l| 


Ns ^ the cardinal number of Bg , i.e., card 


(s,) 


Populate Si using the s value 
Populate A// using the Ng value 
> Calculate MHFDi using the least square regression 
Choose a linear regression model: Mr ^ D* • 

Choose a least square regression operator: i^LS 

Apply the i^LS operator to (^Mr, log (^A//^ , log ^1/2*^^^ ^ to get D 

D ^ cD where c is a normalization factor 
MHFD/ D 


In the next section, a modified HFD is presented. 


4. The Modified HausdorfF Fractal Dimension (MHFD) 

As mentioned in the Introduction section, the input image / may not 
completely satisfy the basic requirements of a fractal shape. To address this 
challenge, we introduce two features: 


^An implementation can be found here: http://www.mathworks.com/matlabcentral/ 
fileexchange/30329-hausdorff--box-counting--fractal-dimension. 
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1. Explicit Valuation of Non-object Pixels. Only boxes that at 
least contain a 0 pixel are counted in. 

2. Probabilistic Discard of Noise Boxes. For boxes with a small 
number of 1 pixels, there should be higher chance that they are ex¬ 
cluded from the count. 

To impose these two features, the definition of Ng is revised: For every s 
and for every non-overlapping box B of patch size 2^, the number of 1 and 
0 pixels are calculated using the Integral Image representation [1, 15]. Let’s 
assume these numbers are ub^i and respectively. If ub^o — 0, the box 
is immediately discarded. Otherwise, a dice of ub^i + I faces is rolled, and if 
the outcome is the face showing 1, the box is discarded. Finally, for every 5, 
only those boxes that have not been discarded are counted in to calculate the 
new Ng. The MHFD can be calculated in a way similar to that of Equation 
( 2 ): 


Di = MHFD/ = argmin i?LS (log (iV, j = D* log (1/2®) + h*j , (3) 

s.t. Ng G A//, 5 = 0, • • • ,5. 

The details of the MHFD calculations are provided in Algorithm 2.^ Please 
note that we also considered optional preprocessing, edge extraction, and 
skeleton extraction steps in the algorithm that could improve the perfor¬ 
mance. This is further discussed in Section 5. 
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(a) 

HFD = 1.6809 

MHFD = 1.2876 




(c) 


HFD = 0.9259 
MHFD = 0.8242 



MHFD = 1.3626 



MHFD = 0.3101 


Figure 1: a) and b) Two synthesized images with their associated HFDs 
and MHFDs. c) and d) The preprocessed images of (a) and (b) and their 
associated HFDs and MHFDs. 
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5. The Illustrative Examples 


We start with two synthesized images, as shown in Figure 1. Figure 
1(a) shows an image with a low level of complexity. Using the algorithm 
of Section 3, a high HFD is obtained (HFD = 1.6809). In contrast, the 
MHFD gives a much lower dimension. If the edge of the image is used, 
which is equivalent to extracting the fractal shape, both HFD and MHFD 
provide low dimensions around 0.9 that better represent the complexity of 
the image. The second synthesized image is a binary salt-and-pepper noise 
(Figure 1(b)). Again the HFD give a high dimension (HFD = 1.8234) while 
the MHFD is able to show more resiliency to the noise. When preprocessing 
and skeletonization are used (Figure 1(d)), again both the HFD and MHFD 
provide a correct low dimension of 0.3. 

In the second example, some fractal images are considered. This exam¬ 
ples are shown in Figure 2. It is worth mentioning that the normalization 
factor of Algorithm 2 has been chosen in such a way that the MHFD of Fig¬ 
ure 2(a) matches its analytic value. Also, in all cases, the edge extraction 
step has been used. Again, a better robustness can be observed in the values 
of the MHFD. 

As the final example, some illustrated alphabetical letters from historical 
manuscripts are considered.^ In particular. Figures 3(a) and 3(b) are Lit- 
terae Notabiliores^ i.e., enlarged letters within a text, designed to clarify the 
syntax of a passage. In contrast, two other examples, shown in Figures 3(c) 


Matlab implementation has been provided here: http://www.mathworks.com/ 
mat labceiitral/fileexchange/50790-modified-haussdorf-fractal-dimension. 

®More specifically, from a manuscript entitled “Collectio decern partium (expansion of 
Ivo of Chartres, Panormia).” More information at: http://parkerweb.stanford.edu/ 
parker/actions/manuscript_description_long_display.do?ms_no=94. 






HFD = 1.5999 HFD = 1.8811 

MHFD = 1.5843 MHFD = 1.6794 



(c) Sample 3.^ 



(f) 

HFD = 1.7129 
MHFD = 1.6263 


Figure 2: Three fractal images and their associated HFDs and MHFDs. 
a)-c) Input images. d)-f) The edge images used to calculate the MHFD. 
Notes: Link: http: //4. bp. blogspot. com/-aHCf mDvyzFU/Un_U- 

Neo_GI/AAAAAAAAGpQ/DWzjztkh4HM/sl600/sierpinski.png. ^ Link: 

http://upload.wikimedia.org/wikipedia/commons/thumb/a/aO/ 
Sierpinski_carpet.png/480px-Sierpinski_carpet.png. § Link: 

http://www.math.upenn.edu/~pstorm/images/round_Sierpinski_ 
carpet.png. 
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(e) 

HFD = 1.5380 

MHFD = 1.4357 



(i) 

HFD = 1.5804 

MHFD = 1.5078 



(m) 

HFD = 1.3731 

MHFD = 1.2944 



(b) 



(f) 

HFD = 1.6135 

MHFD = 1.4930 



j) 

HFD = 1.6091 

MHFD = 1.5335 



HFD = 1.4069 

MHFD = 1.3248 



(g) 

HFD = 1.3943 

MHFD = 1.2692 



(k) 

HFD = 1.5482 

MHFD = 1.4472 



HFD = 1.1629 

MHFD = 1.0619 



(h) 

HFD = 1.3983 

MHFD = 1.2111 



( 1 ) 

HFD = 1.3863 

MHFD = 1.2794 



(P) 

HFD = 1.0864 

MHFD = 0.9594 


Figure 3: a) and b) Samples of Type I {Littera Notabilior) objects, c) and 
d) Samples of Type II (Enlarged Capital) objects. e)-li) Simple binarization 
of (a) to (d) and their associated HFDs and MHFDs. i)-l) The same as (e) 
to (h) but with edge extraction. m)-p) Preprocessed and skeletonization of 
(a) to (d) and their associated HFDs and MHFDs. 
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and 3(d), are regular enlarged capitals. It could be easily observed that the 
Littera Notabilior class shows a higher level of complexity compared to the 
enlarged capital class. Therefore, it could be argued that a fractal dimension 
analysis would be capable to differentiate between these two classes. Three 
alternative cases are considered: i) A simple thresholding algorithm [13] is 
applied to the input images and then the HFD and MHFD are calculated 
(Figures 3(e)-(h)), ii) the edges of the binary images are used in the calcula¬ 
tions (Figures 3(i)-(l)), and hi) the color images are converted to gray using 
the min-average color-to-gray transform [4] and then the skeleton image is 
used in the calculations (Figures 3(m)-(p)). As can be seen from the values 
calculated using the HFD and MHFD, the MHFD provides a better robust¬ 
ness. In particular, for the edge images, i.e.. Figures 3(i)-(l), the average 
intra-class distance is reduced from 0.095 to 0.075 when switching from the 
HFD to the MHFD, while the inter-class distance is increased from 0.125 to 
0.155, respectively. This could be interpreted as an increase of 57% in the 
class differentiability if the MHFD is used, which could be crucial for images 
that fall in the border of the two classes. We will evaluate the performance 
of the measure using a bigger dataset in the future. 

Finally, we would like to present a preliminary analysis of the other 
impacts of the MHFD. It can be easily argued that even for true fractal 
shapes the MHFD discards a high ratio of boxes for s values near 1, which 
correspond to high values of the box-per-pixel resolutions, i.e., 1/2^. To be 
specific, for single-pixel boxes, which correspond to 5 = 0 and a box-per- 
pixel resolution of 1, 50% of potential boxes are on average discarded. A 
less sever effect is expected for lower resolutions. This phenomenon, which 
can be imagined as a Meffection’ effect and shown in Figure 4, would weaken 
applicability of the Hausdorff model (1). To contain the impact of deffection. 
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Figure 4: a) The behavior of the discrete 5/ and Mi associated to the 
HFD and also that of Si and Mi associated to the MHFD. The values 
correspond to the image shown in Figure 2(a). The deflection introduced by 
the MHFD for high values of the high box-per-pixel resolution can be easily 
observed, b) A comparison between linear regression of Equation (2) and 
weighted nonlinear regression of Equation (4) in fitting the MHFD data. The 
nonlinear model provides a better fit especially for the high box-per-pixel 
resolutions. 
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we propose to use a modified nonlinear^ model as follows: 


= = argmini?Ls(log(iV,) = + 

f JV, € = ,S, 

1 = ML’ 

where Wg is the weight vector to be considered in the least square minimiza¬ 
tion. Figure 4(b) shows the performance of the proposed weighted nonlinear 
regression. The green curve, which corresponds to Equation (4), provides 
a better fit especially for low s (high box-per-pixel resolution) values. We 
further investigate this aspect in the future. 

6 . The Conclusions 

A modified box-counting Hausdorff fractal dimension has been intro¬ 
duced in order to apply it in complexity analysis of binary images. The 
core of the modification is based on two features that weaken the require¬ 
ment of presence of a shape and at the same time discard potential noise 
boxes in a probabilistic way. In addition, preprocessing of the input images 
along with edge or skeleton extraction has been considered. The proposed 
method has been tested on noise images, fractal shapes, and also illustrated 
manuscript images. In all cases, the modified dimension showed robustness 
even in absence of edge or skeleton extraction steps. 

In the future, the performance on a bigger dataset of manuscript im¬ 
ages will be evaluated. In addition, irregular pooling of boxes to weaken 
the non-overlapping consideration toward a more robust estimation will be 

^in the sense of the log values of Si and A/}. 
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considered. Finally, nonlinear regression models will be considered in order 
to better fit and absorb the deflection introduced by the proposed method 
in small box sizes. 
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